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ESTIMATION OF CLINICAL CUT-OFFS 

I 

FIELD OF TBDE INVENTION 

[0001] The present invention concerns methods and systems for improving the 
accuracy of predicting resistance of a disease to a drug! More specifically, the invention provide 
metiiods for assessing the impact of pre-existmg variations in drug susceptibiUty, whetiier 
naturally occurring or selected by previous drug exposure, on treatment response in order to 
establidi clinically relevant cut-oflf values for phenotypic or genotypic resistance tests. 

BACKGROUND OF THE INVENTION 

[0002] Techniques to determine the resistance of a disease to a drug are 
becoming increasingly important. Since tiie issuance of the first report suggesting a correlation 
between the ranergence of vhral resistance and clinical progression, techniques to determine the 
resistance of a pathogen to a drug have been increasingly incorporated into clinical studies of 
flierapeutic regimens (see Brendan Larder et al., HIV Resistance and Implications for Therapy 
(1998), herein incorporated by reference). For example, as with viral infections, some studies 
show that p53 mutations may also be predictive of tumour response to specific anticancer drug 
therapy, radiation treatment or gene therapy. This is the case in breast cancer where initial 
studies have shown that cisplatin and tamoxifen are more effective in patients whose tumours 
have a p53 mutation. Thus, the aim of resistance monitoring is to provide the necessary 
infonnation to enable the physician to prescribe the most optimal combination of drugs for the 
individual patient. 
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[0003] With more therapeutic options becoming available over time, resistance 
testing is expected to play an important role in the management and treatment of disease and the 
development of individualized treatment regimes [see e.g. Haulbrich et al JAIDS, 2001,2681, 
S51-S59]. 

[0004] Furthermore, the number of drug resistant diseases is also increasing. 
Phenotyping methodologies measure the ability of a pathogen to grow in the presence of 
different drugs in the laboratory. This is usually expressed as the fold change in the IC50 or IC90 
values (the IC50 or IC90 value being the drug concentration at which 50% or 90% respectively of 
the population of pathogen is inhibited from repHcating). For example, a highly resistant virus 
might show a 50 or even 100-fold increase in IC50, for example. Some viral mutations only 
increase the IC50 by as Uttle as 2-3 fold. On the other hand, a pathogen may exhibit 
hypersensitivity towards a given drug. For example, it has been demonstrated that a combination 
of HIV mutations may lead to hypersensitivity of the pathogen towards a given drug. 

[0005] Unlike genotyping, phenotyping is a direct measure of susceptibility, 
reflectmg the effects and interactions of all the mutations, known or unknown, on flie behaviour 
of the pathogen population in the presence of a drug. 

[0006] The utility of drug susceptibility phenotyping is dependent on the "cut- 
off' value of the fold increase in, for example, the IC50 at which a pathogen is considered 
resistant. The term "cut-off value", as used herein, refers to the threshold change in susceptibility 
above which a pathogen is classed as having reduced susceptibility for a particular drug. Drug 
"resistance", as used herein, pertains to the capacity of resistance, sensitivity, susceptibility or 
effectiveness of flie drug against the pathogen. 

[0007] There has been recent debate regarding the relevance of some cut-off 
values currently in use. For example, for viral infections, certain groups currently use technical 
cut-off values, which are usually the same value for each drug-tested and are determined not by 
clinical criteria but, for example, by the assay variability seen on repetitive testing of a single 
wild type standard virus. By repeatedly running a test with the standard reference virus, the 
reproducibility of the test is measured and a cut-off is set at this level, (e.g., a 2,5-fold increase in 
IC50). This provides a cut-off that depends largely on the analytical performance of the assay. 
This approach suffers from the limitation that it does not consider the population-based variation 
in drug responsiveness. In addition, such an approach does not account for different 
responsiveness towards different drug regimens. The limitations of setting a single cut-off for all 
available drugs m this way is that it tells the clinician very little about the significance of any 
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change in susceptibiUty reported by a test Indeed, some virological cut-off values are clearly out 
of line with known response data. For example, indications of low level resistance to non- 
nucleotide reverse transcriptase inhibitors (NNRTIs) does not lead to blunted responses to drugs 
in previously untreated individuals (Hairigan et al, Bacheler et al, 4th International Workshop 
on HIV Drug Resistance and Treatment Strategies, Sitges, Spain. Abstr. (2000)). Other assays 
have cut-off values that are primarily based on the repioducibiUty of the assay, are the same for 
each drug, or are not related to whether a drug might work against the pathogen in clinical 
practice and are, therefore, ratiier arbitrary. 

[0008] Methods have akeady been described to develop more meaningful, 
biologically relevant cut-off values for drugs used in HIV therapy. For example, Virco measured 
the ICso values for isolates from 1,000 untreated patients as well as many thousands of samples 
of HIV-1 with no resistance mutations. The average and the range of susceptibility were 
calculated for each drug. The cut-offs were then set at two standard deviations above the mean. 
This statistical term means that a test result falling above the cut-off can be said to be above the 
normal susceptible range with 97.5% confidence (Harrigan et al. World-wide variation in HIV-1 
phenotypic susceptibility in untreated individuals: biologically relevant values for resistance 
testing. 2001. AIDS 15:1671-1677). Since the susceptibility of untreated and un-mutated virus 
varied considerably from drug to drug, the predicted biological cut-offs are different for each 
drug. 

[0009] The use of biological cut-offs has changed the amount of resistance 
being reported for HIV. For example, the biological cut-off values for the dideoxynucleoside 
analogues are lower than the cut-offs used previously and. in a study of 5,000 random clinical 
samples, revealed a higher and more reaKstic incidence of resistance. Conversely, the cut-offs for 
the non-nucleoside reverse transcriptase inhibitors are higher than those previously used. 

[00101 However, although the biological cut-off values are a vast improvement 
to the arbitrary cut-offs used previously, there are still disparities between these predicted 
thresholds and the observed fold-resistance above which a cUnical response is actually reduced. 
There is thus a great need for a method that can estabUsh cut-off fold change resistance values 
that are clinically-relevant. 

(001 U The present invention provides a solution to these problems, in the fonn 
of new methods for assessing the impact of pre-existing variations in drug susceptibility, whether 
naturally occurring or selected by previous drug exposure, on treatment response in order to 
establish clinically relevant cut-off values for phenotypic or genotypic resistance tests. 
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SUMMARY OF THE INVENTION 

[0012] According to the invention, there is provided a diagnostic method for 
estimating for a patient the treatment response of a disease caused by a pathogen to a drug, the 
method comprising: comparing the fold change resistance value of the pathogen infecting the 
patient to a clinical cut-ofiF value which is the fold change resistance value at which a clinically 
relevant variation of clinical response is observed; wherein the clinical cut-off value is 
established by modelling the clinical response of a population of patients treated with the drug to 
the disease caused by the pathogen as a function of the fold change resistance of the pathogen 
infecting the patients. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0013] Figure 1 a: Example of the structure of a clinical data base used in the 
present invention. 

[0014] Figure lb: Example analysis dataset for d4T. 

[001 5] Figure 2a: Example of linear regression curve showing censored and 
uncensored observations, where log viral load drop is modelled as a function of baseline fold 
change resistance. 

[0016] Figmre 2b: Example of linear regression curve of 8 week viral load 
response as a function of baseline phenotypic resistance for nucleoside(tide) RT inhibitors. 

[001 7] Figure 2c: Example of linear regression curve of 8 week viral load 
response as a function of baseline phenotypic resistance for boosted and unboosted protease 
inhibitors. 

[0018] Figure 2d: Example of change in viral load vs. baseline fold change 
for the NNRTI of neviripine (NVP). 

[0019] Figure 2e: Example of change in viral load vs. baseline fold change 
for efavirenz (EFV). 

[0020] Figure 2f: Dmg Effect plotted as % response as a Function of 
Baseline Resistance for nucleoside(tide) inhibitors 

[0021] Figure 2g: Dmg Effect plotted as % response as a Function of 
Baseline Resistance for boosted and im-boosted Protease Inhibitors 

[0022] Figure 3a: Example of linear regression curve for TNF, where log 
viral load drop is modelled as a function of baseline fold change resistance and a first definition 
of clinical cut-off is applied. 
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[0023] Figure 3b: Example of linear regression curve for d4T, where viral 
load drop is modelled as a function of fold change and a second definition of clinical cut-off is 
applied. 

[0024] Figure 4a: Example of linear regression curve for TDF, where log 
viral load drop is modeUed as a function of baseKne fold change resistance and a second 
definition of clinical cut-off is applied. 

[0025] Figure 4b: Ecample of lower and upper cutoff determined using 
definition 2 for lopinavir/r if viral load is modeled using linear regression. 

[0026] Figures 4c: Example of lower and upper cutoflfe detennined using 
definition 2 for AZT if viral load is modelled using linear regression. Curve shows the change in 
viral load vs. the fold change. 

[0027] Figure 4d: Example of lower and upper cutofife determined using 
definition 2 for AZT if viral load is modelled using linear regression. Curve shows the % loss of 
reference response vs. the fold change. 

[0028] Figure 5a: Example of logistic regression curve for TNF, where 
probability of failure is inodelled as a function of baseline fold change resistance and a second 
definition of clinical cut-off is applied. 

[0029] Figure 5b: Example of lower and upper cutoffs determined using 
definition 2 for lopinavir/r if the failure rate is modelled using logistic regression. 

[0030] Figure 6: Example of linear regression cxxrve for TNF, where log 
vural load drop is modelled as a function of baseline fold change resistance and a third definition 
of clinical cut-off is applied. 

[0031] Figure?: Example of logistic regression curve for TNF, where 
probability of failure is modelled as a function of baseline fold change resistance and a third 
definition of clinical cut-off is applied. 

[0032] Figure 8a: Example of classification tree for TNF. This gives results 
of the same order as the linear and logistic regression methodologies. 

[0033] Figure 8b: Example of classification tree for 3TC. 

[0034] Figure 8c: Example of classification tree for TDF assuming that the 
cost of classifying a failed regimen as a success is 1.5 times the cost of classifying a successfiil 
regimen as a failure. 

[0035] Figure 9: Example of initial validation of preliminary 
VirtualPhenotypQ™ CUnical Cutoffs for nuclesides(tides) and boosted and unboosted protease 
inhibitors by bootstrapping with 90% confidence intervals. 
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DETAILED DESCIPTION OF ILLUSTRATIVE EMBODIMENTS 

[0036] According to the invention, a threshold fold-resistance is established, 
above which a disease is classified as being resistant to a drug in a clinical context. The method 
models treatment response of the pathogen causing the disease to a particular drug as a function 
of baseline pathogen load, baseline resistance, baseline activity of co-administered drugs targeted 
to the pathogen and treatment history. By "baseline pathogen load" is meant the pathogen load of 
the patient measured at the start of treatment by the drug. By "baseline fold change resistance" is 
meant the fold change resistance to the candidate drug exhibited by the pathogen infecting the 
patient at the start of treatment. By "baseline activity of co-administered drugs targeted to the 
pathogen" is meant the activity against the pathogen of each drug administered in combination 
with the drug for which the treatment response is being modeled. By "treatment historj^' is meant 
the previous drug exposure of the patient (and therefore, the pathogen). 

[0037] In a preferred embodiment, the cut-off value is determined as a function 
of treatment response data in treated subjects, considering baseline pathogen load, baseline fold 
change resistance, baseline activity of co-administered dmgs targeted to the pathogen, and 
treatment history. 

[0038] This method thus provides a prediction of clinical outcome at different 
levels of baseline resistance. According to this methodology, treatment outcome (drop in 
pathogen load and response rate) is modeled by dmg as a function of baseline fold change 
resistance as determined by reference to a system that measures dmg resistance phenotype or 
predicts dmg resistance phenotype from pathogen genotj^e (such as VirtualPhenotype®, Virco). 
The models take into account effects of co-administered dmgs, baseline pathogen load and, 
optionally, treatment history in order to avoid any bias introduced by imbalances of clinically- 
important characteristics. From the model, a prediction of outcome can be made at different 
levels of the baseline fold change resistance of the pathogen. 

[0039] Using this methodology, fold change resistance values obtained by 
comparison of genotype with phenotype (for example, VirtualPhenotype®) are linked with 
clinical outcome. This is a unique approach; other research groups use different approaches 
whereby particular mutations or actual phenotype results are linked with clinical outcome. 

[0040] The methodology of the invention is advantageous over those currently 
used. For example, conventional approaches do not fully account for the population-based 
variability in dmg sensitivity. In the present method, the population may include treatment naive 
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and treatment experienced patients, and may be a mixed population which is not restricted to, for 
example, a single gender, age, race or sexual behaviour. 

[0041] The method of the invention also accounts for the different 
responsiveness in a population towards different drugs. The drug-specific clinical cut-off values 
detemiined by this approach are more reliable parameters in estimating resistant over sensitive 
strains of pathogen. 

[0042] The method also allows clinical cut-offs established usmg the method to 
be re-calculated depending on the type of population studied, i.e. a paediatric population may 
have a different clinical cut-off for a particular drug than the adult population for the same dmg. 

[0043] Of particular importance, this methodology allows the determination of 
clinical cut-offs for all marketed dmgs in a vmiform, scientific maimer on a substantial database 
using data derived firom response to combination therapy. Currently available cut-offs are 
determined by reference to a limited amount of data and may be inconsistent as they are 
detemiined using different approaches. 

[0044] According to the invention, clinical cut-off values are established by 
modeling the clinical response of a population of patients treated with the drug to the disease 
caused by a particular pathogen as a fimction of the fold change resistance of the pathogen 
infecting the patients, The fold change resistance for a pathogen may be established using 
methods known in the art. Briefly, the sensitivity of a patient sample for a particular drug is 
compared with the sensitivity of a reference sample for that same drug. This may be done by a) 
determining the sensitivity of a patient sample for the drug; b) determining the sensitivity of a 
reference sample for the drug; and c) determining the patient fold change resistance firom the 
quotient of the sensitivity obtained in step a) over the sensitivity obtained in b). Examples of 
preferred methods for performing these steps are described in detail in co-pending applications 
WOOl/79540 and WO02/33402. Equivalent methods will be apparent to the person of skill in the 
art. 

[0045] In a preferred embodiment of the invention, the cut-off fold change 
resistance value is calculated by reference to the log of the pathogen load drop. In such a method, 
a linear regression analysis is preferably performed using a set of treatment response data firom 
subjects harbouring the pathogen, wherein the log pathogen load drop LogPL dropu for the 
pathogen infecting a patient z, is modelled as the sum of all of the individual contributions for 
factors that influence pathogen load drop, according to the following equation: 

LogPLdrop. = yff^, + fi^LogiBaselinePL;) + fi^^iPSS,) + fi^Q./ FC,^ + 
-7- 
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10046] In this equation, BaselinePU represents the pathogen load of the patient 
measured at the start of treatment by the drug. 

10047] PSSi is a phenotypic sensitivity score representing the number of active 
drugs in the background treatment regimen for the patient, excluding the drug whose 
contribution to treatment response is being modelled. 

[0048] FCi is a baseline fold change resistance. 

[0049] po is the intercept. 

[0050] p, is a coefficient representmg the increase in log pathogen load drop per 
unit increase of the log of the BaselinePU. In the case of HIV and HCV infection, baseline PLiis 
readily quantified by validated commercial assays. 

[0051] P2 is a coefficient indicating the increase in log pathogen load drop per 
unit increase of the number of sensitive drugs in the background treatment regimen. 

[0052] p3 is a coefiRcient indicating the increase in log pathogen load drop per 
unit increase of the inverse of FCt. The value of this coefficient is part of the output of the 
described model. 

[0053] Si, is an error term which represents the difference between the modelled 
prediction and the experimentally det^mined measurement. 

[0054] The coefBcients in the linear regression model may be calculated using a 
computer analysis package such as PROC LIFEREG. PROC LIFEREG is a procedure within the 
SAS (Statistical Analysis System) software which performs linear regression on censored data. 
By default, the LIFEREG Procedure confutes initial values for the parameters using ordinary 
least squares (OLS) ignoring censoring. The log-Ukelihood function is maximized by means of a 
ridge-stabUized Newton-Raphson algorithm. 

[0055] PSSi, the phenotypic sensitivity score, represents the number of active 
drugs in the background treatment regimen for the patient, as predicted from pathogenic 
genotype by YirtualPheaotype^ or other algorithms or as measured by actual phenotype testing. 
The puipose of this term is to allow a drug-specific value to be extracted from treatment response 
data that has been collected for a patient that has received a combination of drugs. In this way, 
resistance data relevant solely to the particular drug under investigation is extracted. The other 
drugs are considered the background regimen; this may be different for different patients. It is 
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necessary to analyze patients with different background regimens together as there would not be 
enough data to do a sound analysis otherwise. 

[0056] During this analysis it has to be taken into account that different 
background regimens influence the clinical outcome in a different way. In order to do this, the 
activities of background drugs are summarised, by determining the number of active drugs, and 
thus devising a PSS (preferably judged as active according to VirtualPhenotype®). The PSS is 
then included in the model. 

[0057] In a preferred embodiment, the PSS may be calculated based on 
preliminary clinical cut-offs which are determined as described. The concept of PSS is discussed 
in detail by DeGmttola et ah (Antiviral Therapy 2000; 5:41-48). In addition, the concept of 
contmuous PSS as a variation of PSS is discussed by Bosch et ah (AIDS 2003, 17:1-9); 
Katzenstein et al (AIDS 2003; 17:821-830); andHaubrich et al CTDelavirdine 
Hypersusceptibility (DLV HS): Virological Response and Phenotypic Cut-Points - Results from 
ACTG 359"; 1 1th Conference on Retroviruses and Opportunistic Infections held on 8-1 1 
February 2004 in San Francisco, CA, USA). The PSS may be determined by an iterative process 
such that the cut-off value is refined to a constant value. In subsequent iterations of the model, 
PSS scores based on preliminary clinical cut-offs defined in the first iteration of the model may 
be utilized. 

[0058] FCi, the baseline fold change, is equivalent to baseline fold change 
resistance. These terms are used interchangeably herein. This is a patient-specific temi and is 
determined based on a dmg susceptibility phenotype test or predicted based on the genotype of 
the pathogen infecting a particular patient The phenotype exhibited by the pathogen of this 
genotype may be predicted in a number of ways; generally, such techniques compare the 
genotype to phenotype data collected from a group of patients infected with a pathogen of 
smiilar genotype. However, this does not change the fact that this fold change resistance is a 
characteristic of the specific pathogenic strain infecting an individual patient at baseline. 

[0059] For example, prediction of baseline fold change resistance may exploit 
rules-based or other less direct systems of determining the drug resistance phenotype of a 
pathogen. An example of a less direct system is the Virtual Phenotype (Virco, Inc.; 
PCT/EPO 1/04445). Prediction of baseline fold change resistance may alternatively use other 
systems for determining phenotype from genotype information, such as neural networks that 
determine the drug resistance phenotype of a pathogen based on its genotypic infomiation (see, 
for example, U. S. Patent Application No. 09/589,167; PCT/EPO 1/06360. The neural network 
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may be used to identify mutation(s) or mutation patterns that confer resistance to a drug and 
defines the genetic basis of drug resistance. 

[0060] po, the intercept, is the estimated log pathogen load drop for a reference 
group i.e. a theoretical group of patients with a baseline pathogen load of one, an infinite fold 
change resistance and no sensitive drugs in the background. The purpose of this term is to 
improve the model fit. If it was not included, the fitted curve would be forced to pass through the 
origin (zero Log PL drop at zero fold change resistance), which could lead to an unrealistic 
model. 

[0061] The error term, c/. represents the difference between the modelled 
prediction and the experimentally determined measurement i.e. the difference between the actual 
response of the patient and Hie predicted response. As more data are added to the model, 
additional factors that are relevant to the determination of clinical cut-off values will be added. 
This will improve the model fit and therefore the error of the prediction will decrease. All the p 
terms are estimated simultaneously by minimizing Ihe error term. 

[0062] In this methodology of this embodiment of the invention, censoring 
(pathogen loads beyond the assay range caused by the detection limits of pathogen load kits) 
affects the results and therefore procedures that take censoring into account are preferably 
applied. Preferably, censored values are dealt with by attempting to construct a model that is 
consistent fi-om extrapolations. TTiis model is appUcable to any described methodology. 
Censored values are thus modelled by replacing the censored value by a maximum likelihood 
estimation, assimiing knowledge of the standard deviation of the measurement error. For 
example, censored values may be dealt with using the PROC LIFEREG pre-programmed 
procedure in the statistical analysis package SAS that performs analyses with censored values. 

[0063] An advantage of the linear regression method described above is that 
quantitative data about changes in pathogen load can be studied because parogen load is 
considered as a continuous variable. This therefore takes into account the maximum amount of 
information present in the data. Estimates are corrected for covariates in the model (for example, 
background regimen) and therefore, do not suffer firom imbalances in the covariates. Conclusions 
are limited to patients with covariates that are represented in the dataset in the clinical response 
database. 

[0064] Oth^ baseline characteristics may be added to flie linear regression if 
relevant, resulting in the addition of new terms in the equation given above. Examples of 
additional baseline characteristics include the total duration of flie previous treatment, and the 
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time at which treatments were administered. For example, estimates can be corrected for 
duration by adding a term P4(Duration) in the model equation given above. 

[t^t'^S] Furthermore, additional factors may be taken into account, including 
sensitivity score per drug class (in addition to the overall sensitivity score of the background 
treatment: cPSS), previous exposure to the dmg (naive: Yes or No; naive to protease inhibitors: 
Yes or No; naive to nucleotide RT Inhibitors: Yes or No; and so on). Further examples will be 
. clear to those of skill in the art. 

[0066] A quadratic term for the cPSS may be added to the model. 

[0067] The fold change resistance may be transformed before putting it into the 
model. For example, a power transformation ranging from FC"^ to FC* may be performed on the 
fold change. 

[0068] Accordingly, a more general form of the equation presented above may 
be expressed as: 

LogPLdrop, = fii,-^ fi,LogiBaselinePL,) + fi^{cPSS,} + fi^icPSSt)^ -¥ fi^iFC^^^ fi„iH„) + s, 

wherem p is a power transformation (e.g. ranging from -3 to 1) and H5 to Hn are treatment history 
parameters (e.g. naiVe to antiretroviral therapy, naive to NRTI treatment, etc. ...) or parameters 
describing the background therapy as a ftmction of a certain therapeutic class (e.g. the number of 
active NRTFs taken concomitantly with the drug under investigation). 



[0069] An example of characteristics of analysis datasets (8 week outcome) for 
individual drugs are in the following form: 





Range (Drug) 


Median Baseline Viral Load (log) 


3.32 (TDF) - 4.71 (boosted IDV) 


Median background cPSS 


L34 (ddC) - 2.58 (LPV/r) 


# regimens including the dmg 


24 (unboosted APV) - 1551 (3TC) 


% from cohort data 


21% (unboosted APV) - 83% (ddl-EC) 


% with no resistance mutations 


14.5% (boosted APV) - 75% (EFV) 



[0070] As stated above, the clinical cut-offs determine the fold change 
resistance with a diminished predicted clinical response to dmg. In an alternative to merely 
classifying pathogens as sensitive or resistant, the method of this aspect of the invention 
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preferably incorporates three classifications, namely "sensitive", associated with maximum 
response to drug therapy, "intermediate", associated with reduced, but still significant response 
to drug therapy, and "resistant", associated with little if any response to drag therapy. For 
example, by one set of definitions relevant for HIV response, "sensitive" may be classified as a 
predicted pathogen load drop of more than about 0.6 logs, "intermediate resistance" may be 
classified as a predicted pathogen load drop of between about 0.2 and about 0.6 logs and 
**resistant" may be classified as a predicted pathogen load drop of less than about 0.2 logs. In 
another set of definitions, "sensitive" may be classified as a predicted pathogen load drop of 
between about 0.5 logs and 1.0 logs. Cut-offs calculated using these definitions are highly 
dependant on covariates. 

[0071] In a further preferred embodiment of the invention, the cut-off fold 
change resistance value is calculated by reference to the probability of the pathogen being 
susceptible to treatment by the drug for the patient, herein termed Prob of success. In such a 
method, Prob of success is preferably calculated by performing a logistic regression analysis 
using data firom a clinical response dataset, wherein Prod of success is modelled according to the 
following equation: 

Prob of 

success = exp(^o + /3,Log(BaselmePL.) + (PSS,) -h J3, (1/ FC,)) 

(l + exp(>ffo P.LosiBaselinePLi)^ JB^iPSSg)^ JS^(1/ J^C,)}) * 

[0072] The terms in the equation are the same as those described above for the 
embodiment of the invention described above. 

[00731 This method of logistic regression does not suffer jfrom the censoring 
problem described above for the linear regression model. Furthermore, the probability of success 
is an intuitive way of interpreting clinical outcome. One disadvantage is that by classifjdng the 
pathogen load into successes and failures, part of the information of the continuous variable 
pathogen load is lost. 

[0074] Estimates may also be corrected for covariates as for linear regression. 

[0075] Again, like the method of the first described embodiment of the 
invention, the method of second described embodiment also preferably incorporates the three 
classifications, sensitive, intemiediate resistant and resistant. On the basis that the maximum 
effect is defined as the treatment effect at a fold change resistance of approximately 1 fold 
change or the fold change demonstrated by wild type viral isolates from patients, and the 
minimmn effect is defined as the treatment effect at a very high fold change resistance (i.e. when 
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the curve reaches a plateau), the "effect range? is the difference between the maximum effect and 
the minimum effect The maximum effect may be defined as the treatment effect at fold change 
resistance of between about 0.7 and about 1,2 fold change resistance. 

[0076] Preferably, a "sensitive" genotype is classified as a predicted treatment 
effect of more than about 78-85% of the effect range. Preferably, "intermediate resistant" is 
classified as a predicted treatment effect of between about 15-25% and about 75-85% of the 
effect range. Preferably, "resistant" is classified as a predicted treatment effect of less than about 
15-25% of the effect range. Cut-offs calculated using this method are less dependent on 
covariates than the method described earlier which uses predicted pathogen load drops. 
However, the effect range will vary for different covariates. 

[0077] With this approach, two cutoffs per drag are identified: a "lower" cutoff 
which represents the fold change at which the response begins to be lost, and an •*upper" cutoff 
which represents the fold change at which the response is essentially gone. 

[0078] The lower and upper cutoffs may be defined as the fold change with 
expected log viral load drops of >0.6 and <0.2 respectively. 

[0079] The lower and upper cutoffs may be defined as the fold change 
associated with an expected 20% and 80% decrease respectively of the reference activity of the 
drug within the regimen. 

[0080] Accordingly, a first definition (definition 1) of lower and upper cutoffs 
are the fold changes with expected log viral load drops of >0.6 and <0.2 respectively. 

[0081] A second definition (definition 2) of the lower and higher cutoffs are the 
fold changes associated with an expected 20% and 80% decrease respectively of the reference 
activity of the drag within the regimen. 

[0082] A third definition (definition 3) of the cut-off is the fold change that 
most optimally distinguished between patients with successful and imsuccessful treatments. 

[0083] In one embodiment of the invention, definition 1 is applied for Tenofovir 
on a population taking two active drags besides tenofovir and with a baseline Log(Vl) of 4, gives 
an predicted drop in log viral load of -0.6 at fold change 3.73. 

[0084] If definition 2 is applied on the linear regression model, the predicted 

drop in log viral load may be -1.48 at fold change 1, and -0.28 at the maximum fold change. 

Therefore the effect range may be -0.28 + 1 .48 =1 .2. 20% of this effect range was observed at 

fold change 5 (and this value was considered as the upper clinical cut-off value). 80% of the 
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effect range was observed at fold change 1.25 (and this value was considered as flie lower 
clinical cut-off value). 

[0085] For this embodiment, when the FC of patient is 0,8 (below the lower 
clinical cut-off), a normal clinical response is predicted. If the FC of the patient is 2 (above the 
lower cUnical cut-off and below the upper clinical cut-ofiO» a reduced clinical response is 
predicted. If the FC is 7 (above the clinical cut-off), then the clinical response is predicted as 
being minimal. 

[0086] In an embodiment when definition 2 is applied, and the logistic model is 
used for tenofovir, a lower cut-off at 1.2 FC and a higjier cut-off at 3.81 FC are determined. 

[0087] The following table depicts a number of embodiments of the invention 
for tenofovir for a population with 2 active dmgs in the regimen and a baseline Log(VL) of 4. 



Definition of 






Linear 
Regression 




Logistic 
Regression 


Classification 
Tree 


Clinical 
cut-oflf 


Population 


Properties of the subgroup 


Lower 
CO 


Higher 
CO 


Lower 
CO 


Higher 
CO 


Lower 
CO 


Higher 
CO 


Definition 1 


Subgroup 1 


PSS=2, baseline Log(VL) = 4 


3.73 


> assay limit 


NA 


NA 


NA 


NA 




Subgroup 2 


PSS=0, baseline Log(VL) = 4 


L68 


3.8 


NA 


NA 


NA 


NA 




Subgroup 3 


PSS=«2, baseline Log(VL) « 5 


> assay limit 


> assay limit 


NA 


NA 


NA 


NA 




Overall 




NA 


NA 


NA 


NA 


NA 


NA 


Definition 2 


Subgroup I 


PSS=2, baseline Log(VL) = 4 


1.2S 


5 


L2 


3.81 


NA 


NA 




Subgroup 2 


PSS=0, baseline Log(VL) = 4 


1.25 


5 


1.16 


3.36 


NA 


NA 




Subgroup 3 


PSS=2, baseline Log(VL) = 5 


1.25 


5 


1.17 


3.4 


NA 


NA 




Overall 




1,25 


5 


NA 


NA 


NA 


NA 


Definition 3 


Subgroup 1 


PSS=2, baseline Log(VL) = 4 


1.1 


5 


1.2 


3.81 


1.15 


NA 




Subgroup 2 


PSS=0» baseline Log(VL) « 4 


1.1 


5 


1,2 


3.36 


1.15 


NA 




Subgroup 3 


PSS=2, baseline Log(VL) = 5 


1.1 


5 


1.2 


3.4 


1,15 


NA 




Overall 




l.l 


5 


1.2 


NA 


1.15 


NA 


NA: Not Appltcable 





[0088] In a further embodiment, applying definition 1 on the linear regression 
model for patients with a log baseline viral load of 5 and all patients taking two active drugs in 
addition to d4T(stavudine), a viral load drop of more than 0.6 log copies/mL for any fold change 
of d4T is predicted. The viral load drop is predicted to be -0.6 logs and -0.2 logs at fold changes 
2.6 and 4.0 for patients with a log baseline viral load of 5 and taking no active drugs in addition 
to d4T. 

[0089] In another embodiment, lower and upper cutoffs predicted using 
definition 2 for lopinavir/r are 8 and 69 respectively for the whole population if viral load is 
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modeled using linear regression, and ttie lower and upper cuttofs are 11 and 64, 10 and 60, and 9 
and 58 respectively for populations with log baseline VL/background PSS of 4/2, 5/0 and 5/2 
respectively if the failure rate is modeled using logistic regression. 

[0090] In another embodiment, lower and upper cutoffs determined using 
definition 2 for boosted saquinavir for the logistic model are 1.7 and 13.2, and 1.7 and 12.9 
respectively for populations with log baseline viral load/phenotypic sensitivity score for the 
background regimen of 4/2 and 5/0 respectively. In the same circumstances lower and upper 
cutoffs by linear regression for saquiavir/r are 1.6 and 12.3 respectively for the whole population, 

[0091] The following table shows a number of embodiments modelled using a 
preliminary linear regression analysis and definion 2: 



1 

DRUG 


VIROLOGIC RESPONSE | 


Baseline FC for 20% 
REDUCTION of 
response(iower 
cutofjf) 


Baseline FC for 80% 
REDUCTION of 
response (upper 
cutoff) 


AZT zidovudine 


1 .8 ri .5-2.5] 


17 [10-25] 


3TC lamivudine 


1.1 [1.1-1.2] 


2.6 t1 .9-4.61 


D4T stavudine 


1.3 [1 .2-1.4] 


3.4 [3.1-3.61 


Ddi didanosine 
(extended release) 


1.3 [1 .2-1.9] 


3.6 [2.8-4.9] 


ABC abacavir 


1.6 n .1-2,61 


5.8 [1 .7-7.4] 


TDF Tenofovir 


1,2 ri .1-1.51 


2.5 [1.7-3.8] 


■ IDV indinavir 


1.2 [1.1-1.9] 


3.4 n .9-1 6.41 


IVD/r indinavir/r 


3.5 ri. 1-8.4] 


25 [1 .8-31] 


NFV nelfinavir 


1.1 ri.1-1.3] 


2.2 [1 .7-5.3] 


SQV saquinavir 


1.1 M .1-2.1] 


2.0 [1 .7-18] 


SQV/r Saquinavir/r 


1 .6 [1 .3-4.8] 


12 [5.8-27] 


AMP annprenavir 


1.2 ri. 1-2.41 


3.4 [1.7-10.21 


AlVIP/r amprenavir/r 


1.5 ri .2-2.6] 


6.8 [3.6-10.51 


LPV/r Loplnavir/r 


6.9 [2.1-17.41 


56 [29-671 1 



[0092] The values between brackets in the table are the 90% confidence limits 
as detennined by bootstrapping. 

[0093] In another embodiment, lower clinical cutoffs modelled using definition 
2 after 8 weeks were 1.1 to 1.2 for unboosted Pis, and upper clinical cutoffs were 2.0- 3.4 for 
unboosted Pis. Clinical cutoffs for boosted Pis were higheri lower CCO - 1.5 -6.9 and upper 
ceo 6.8-56. 

[0094] In an example for D4T the linear regression model comprises: 
VLdrop = 2.91 - 0.63Log(BaselineVL) - 1.66(FC^-^) - 0.99(cPSS) + 
0.15(cPSS') - 0.18(PSS[NRTI]) + 0.91(NRTI[natve]). 
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[0095] The coefficients in the linear regression model were calculated using 
PROC LIFEREG. 

[0096] ITie ]SRTI[naive] value represents whether the patient is naive to 
nucleoside RT inhibitors. If yes, value =1, if no, value =0. The PSS[NRTI} represents the 
phenotypic sensitivity score for NRTIs, i.e the number of active NRTIs in the background 
regimen for the patient. 

[0097] When Log(BaselineVL) = 4, lower reference fold change =0.9 and upper 
reference fold change=3, cPSS=2, PSS[NRTri=l and NRTI[naive]=0, the modelled lower and 
upper CCOs are modeled as 1.1 and 2.2 respectively using definition 2. 

[0098] In another example, the linear regression model is applied to an 8 week 
viral load response and modelled as a function of baseline phenotypic resistance. The subjects 
may have a log baseline viral load = 4.0 and a cPSS score for the backgroimd regimen of 2.0, 

[0099] In a further preferred embodiment of the invention, the cut-off fold 
change resistance value is calculated by constructing a classification tree in order to classify the 
likelihood of a patient having an undetectable pathogen load after treatment with a particular 
drag, as a success or a failure. This methodology constructs tree-structured rules in order to 
classify patients as successes (undetectable pathogen load after treatment) and failures. For 
example, for a virus an imdetectable pathogen load could be defined as a viral load of less than 
400 viral copies per ml. Such a classification tree has the advantage that it is very visual and easy 
to interpret, although it suffers firom the limitation that the decisions do not take into account the 
value of certain other relevant parameters. Imbalances for such parameters may therefore 
influence the decision taken for a certain parameter. However, such trees provide insights into 
the importance of several parameters and this can be helpful in the fitting process of the linear 
regression and logistic regression approaches described above. 

[0100] The classification tree poses queries, in which the answer to each query 
results in either the left or the right branch of the tree being taken at each stage. For example, the 
first query may preferably consider the fold change resistance of the pathogen genotype to the 
^ drug in question e.g. is fold change for the dmg TDF (tenofovir) < 1.35? If yes, the left branch is 
taken, if no the right branch is taken. As with the methods of the aspects of the invention 
described previously, the other factors queried include the log baseline pathogen load and the 
phenotypic sensitivity score. The numbers at the termini of the final branches represent the 
response rate (1 = 100% response). Examples of classification trees according to the invention 
are provided in Figures 8a, b and c. 
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[0101] In this embodiment of the invention, the clinical cut-off is defined as the 
fold change resistance threshold value that makes the best distinction between successful and 
unsuccessful treatments i.e. the most suitable value posed in the query that bifiircates the tree 
into the left and right branches. The population is thus split into two subgroups: one with a high 
success rate and one with a low success rate. The clinical cut-off is chosen as the fold change 
that makes the difference between the two groups as large as possible. 

[0102] Preferably, two or all three of the methods of the above-described 
embodiments of the invention are performed for each dataset and candidate drug. The clinical 
cut-offs can in this manner be calculated for each of the approaches. From the analysis results, 
the most appropriate values for lower and higher cut-offs are selected, taking into account the 
advantages and the disadvantages of the separate approaches. This selection will only be made if 
the results of the approaches are consistent or if possible inconsistencies can be explained. If 
there are unexplained inconsistencies between the results, it can be concluded that more data 
need to be gathered before a clinical cut-off can be determined. 

[0103] For example, if the results of the different approaches are consistent 
(preferably clinical cut-off difference < 0.5) then the predictions are deemed to be consistent. If 
the results differ more than that, the disparities need to be explained. For example, if we suppose 
that the population contains 90% censored values and the linear regression gives a clinical cut- 
off of 0.9 while the logistic regression gives a clinical cut-off of 3.5, then in this example the 
linear regression results are less reliable because too much correction has to be made for 
censoring and there is too little information contributed by "complete" observations. 

[0104] The models may be validated using bootstrapping or by repeating the 
described steps several times. 

[0105] Alternatively, the model may be validated by calculating a concordance 
index (c-index) (Harrell F.E., Lee K.L. and Mark D.B. - Multivariable prognostic models: issues 
in developing models, evaluating assumptions and adequacy, and measuring and reducing errors 
- Statist. Med. 1996; 15:361-387) which describes how all the models can discriminate between 
patients with a different response. 

[0106] This c-index may be calculated on the data set used for model 
development and on a further test data set. If the difference between the two c-indices is small, it 
means that the models do not lose their predictive ability if applied to new data. 

[0107] Further factors which may be taken into accoimt when validating the 
model are the odds ratios determined using clinical cutoffs compared to those determined using 
biological cutoffs. These ratios represent the odds of being a responder in the group that is 
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labeled resistant by the cutoffs divided by the odds of being a responder in the group that is 
labeled sensitive by the cutoffs. In this way it is possible to evaluate how well flie cutoflfe 
perform on a drug by drug basis. The further away that the odds ratio is from 1, the stronger the 
correlation between the resistance class and clinical outcome. The odds ratio as specified above 
yield nimabers smaller than 1 as the odds of being a responder should be smaller in the higher 
resistance class. Put another way, the probability of response will decrease as the resistance 
increases. Additionally, if the odds ratio for the CCO is smaller than the odds ratio for the BCO, 
it can be concluded that the CCO is stronger correlated with clinical outcome and therefore gives 
a better prediction than the BCO. 

[0108] It is also possible to study the difference in response rate for patients 
taking 1 active drug more vs. patients taking one active drug less. This, different type of odds 
ratio is the ratio of odds of response for people taking more active drags over the odds for people 
taking less active drugs. In this case, the odds ratio should be > 1 as the probability of response 
will increase as the number of active drags taken increases. In this case, a larger odds ratio 
indicates a stronger correlation with clinical outcome. In one dataset, using the CCOs set out on 
page 36 herein, the odds ratio for response per additional active drug added was 3.01 when 
calculated using clinical cutoffs, and 2.32 when calculated using biological cutoffs. 

[0109] A logistic regression model may be used to determine the odds ratio. The 
model used is the same as the logistic regression model described above except that the fold 
change in the model is replaced by the resistance class. The coefficient for the resistance class 
that is obtained from the model is the log (odds ratio). The advantage of using this model is that 
the odds ratio estimates can be adjusted for the baseline viral load and the cPSS score of the 
background regimen. 

[0110] A nimiber of embodiments are shown in the following table which gives 
values for lower and upper cutoffs with confidence intervals modelled using definition 2 and a 
linear regression model for several drags, and validated using C-indices, and CCO odds ratios. 



Dmg 


ModeUed Lower Cutoff & 
95% Confidence Interval 


Modelled Upper Cutoff & 95% 
Confidence Interval 


AZT 


1.9 r 1.52-2.761 


14.4 [8.24-21.20] 


3TC 


1.1 [0.98-1.39] 


3.7 [1.71 -11.44] 


D4T 


1.1 ri.05-L121 


2.2 [2.05 - 2.30] 


DDIE 


1.3 [L07-1.34] 


3.0 [2.59 - 2.96] 


ABC 


0.8 rO.75 - L721 


1.2 [1.19-5.11] 


TDF 


1.0r0.97-L321 


2.0 [1.51-2.95] 


NVP 


1.5 ri.40- 16.741 


3.2 [2.22 - 63.06] 


EFV 


1.8 [L41 -3.74] 


29.2 [6.46-146.69] 


IDV 


0.8 [0.77 -1.04] 


2,2 [1.33-7.19] 
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Modelled Lower Cutoff & 
95% Confidence Interval 


Modelled Upper Cutoff & 95% 
Confidence Interval 


IDV/r 


4.1 r0.77--6.24] 


21.2 ri.41- 22.861 


APV 


0.7 rO.65 - 0.87] 


1.4 [1.03-5.551 


APV/r 


0.9 rO.80- 2.721 


6.5 [4.06 - 16.22] 


NFV 


1.0 rO.97- 1.031 


1.5 [1,54-2.381 


SQV 


0.7 [0.65-2.281 


1.0 [1.03-22.611 


SQV/r 


1.1 rO.8 1-5.981 


12.0 [4.50 -28.851 


LPV/r 


103 ri.53- 17.301 


61.6 [21.92 -66.961 



[0111] The methods of the invention can be repeated for each possible drug or 
therapeutic agent known or suspected to be associated with disease resistance, or towards which 
a resistance can be expected to appear. As such, according to another embodiment of the 
invention, the clinical cut-offs generated can be presented as a list of cut-offs against or in 
respect of individual drugs or individual therapeutic agents, for each pathogen. 

[0112] As used herein, the term "drug" includes, but is not limited to, a 
pharmaceutical, bactericide, fungicide, antibiotic, or anticancer, antiviral, anti-bacterial anti- 
fungal, anti-parasitical or any other compound or composition that can be used in therapy or 
therapeutic treatment. 

[0113] A "patient" may be any organisin, particularly a human or other 
mammal, suffering from a disease or in need or desire of treatment for a disease. A patient 
includes any mammal, including farm animals or pets, and includes humans of any age or state 
of development, A group of patients useful to establish treatment response as a function of the 
distribution of fold change resistances maybe as low as 10 to 50 patients, 50 to 500 patients, or, 
more preferably, will comprise a population of 500 or more patients. The distribution fold 
change resistances can be a normal distribution (Gaussian distribution) or can be a non-normal 
distribution. The non-normal distribution may be transformed to obtain a normal distribution. 

[0114] The patient samples may be from treatment naYve or treatment 
experienced subjects, with or without resistance to one or more drugs. 

[0115] As used herein, the term "disease" refers to a disease caused by infection 
with a pathogen. The term "pathogen", as used herein, is used broadly and refers not just to 
pathogenic microorganisms, but includes any disease-causing agent. Examples include bacteria, 
viruses such as human inamunodeficiency vims (HIV), hepatitis C (HCV) or hepatitis B (HBV), 
prions, algae, fimgi, protozoa and malignant cells. Tliis invention is particularly useful for viral 
diseases such as HIV. 

[0116] A "patient sample" is herein defined as any sample obtained from an 
individual suffering from or predicted to be suffering from a disease caused by a pathogen, and 
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includes tissues such as blood, serum plasma, urine, saliva, semen, breast milk, faeces, mucous 
samples, cells in cell culture, cells which may be further cultured, biopsy samples and so on. In 
one embodiment, for a patient infected with HIV, any biological sample-containing virus may be 
used. Of this patient sample, the pathogen itself may be used or alternatively a protein, or nucleic 
acid derived from the pathogen. Preferably, the pathogen is a virus, such as a retrovirus. 
Preferably the biological sample contains a virus chosen from HIV, HCV (Hepatitis C Virus) and 
HBV (Hepatitis B viras). In another embodiment, for a cancer patient, the patient sample may 
contain cells, tissue cells, mutated cells, malignant cells, cancer cells, whole or partial tumours, 
biopsy tissue, etc. Preferably, the pathogeh is a malignant cell. A "reference sample" is defined 
as a standard laboratory reference pathogen such as, for example, in the case of HIV, the HIV 
LAI XnB strain. One strain generally used as the reference *Svild type" sequence for HIV is 
HXB2. This viral genome comprises 9718 bp and has an accession number in Genbank at NCBI 
M38432 or K03455 (gi number: 327742). Reference or wild type sequences for use in the 
invention in the field of specific diseases, infections or diseases caused by specific pathogens can 
be easily obtained from publicly available databases. 

[0117] "Susceptibility" or "sensitivity" to a drug refers to the capacity of the 
disease, and/or pathogen to be affected by the drug. "Resistance" refers to the degree to which 
the disease and/or pathogen is imaffected by the drug. The sensitivity, susceptibility or resistance 
of a disease towards a drug may be expressed by means of an IC50 value. The IC50 value is the 
concentration at which a given drug results in a reduction of the pathogen's growth compared to 
the growth of the pathogen in the absence of a dmg. Resistance of a disease to a drag may be 
caused by aherations in phenotype or genotype. Genotypic alterations include mutations, single 
nucleotide polymorphisms, microsatellite variations, and/or epigenetic variations such as 
methylation. Phenotypic variations may be effected by genotypic variations or by post- 
translational modification. 

[0118] Any method capable of measuring changes in the ability of a pathogen to 
grow in the presence of a drag(s) can be used in the method of the present invention. Such 
methods of phenotyping include all methods known to persons of skill in the art. Known 
genotyping methods may also be applicable. 

[01 1 9] For example, and by way of illustration, methods for phenotyping 
bacteria suitable for use in the present invention include, but are not limited to, measurement of 
inhibitory zone diameters (see, e.g., Guoming et al. Sex Transm. Dis. 27 (2) : 1 15-8 (2000)), 
colorimetric indicator methods (see, e.g., Lozano-Chiu et aL, Diagn Microbiol Infect Dis. 1998 
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Jul;31(3):417-24), and broth macrodilution method (see, e.g., Iwen et al, J. Clin. Microbiol. 34 
(7) : 1779-83 (1996)). 

[0120] As an additional illustrative example, methods for phenotyping 
pathogens suitable for use in the present invention include, but are not limited to, plaque 
reduction assays, PBMC p24 growth inhibition assays (see, e.g., Japour et al^ Antimicrob 
Agents Chemother. 1993 May;37(5): 1095- 101; Kusumi etaL, J. Virol. 66 : 875-885 (1992)), 
recombinant virus assays (see, e.g., Kellam & Larder, Antimicrob. Agents Chemother, 38 : 23-30 
(1994); and Pauwels et al^ 2nd Ihtemational Workshop on HIV Dmg Resistance and Treatment 
Strategies, Lake Maggiore, Italy. Abstr. 51 (1998)); the use of GFP as a marker to assess the 
susceptibility of anti-viral mhibitors (Marschall et al. Institute of Clin, and Mol. ViroL, 
University of ErlangerNuremberg, Schlobgarten, Germany); and cell culture assays (Hayden et 
aU N. Eng. J. Med. 321 : 1696-702 (1989)). 

[0121] Though the invention may be used with any phenotype or genotype 
measuring test or assay that determines resistance, the following descriptions are designed to 
describe further possible applications of the invention. 

[0122] In one embodiment, the clinical cut-off values may be used in concert 
with direct phenotype assays, for example, Antivirogram™ (Virco, Inc.; WO 97/27480, US 
6,221,578). This assay is a phenotypic resistance assay that measures, in controlled laboratory 
conditions, the level of resistance of the HIV derived from an individual patient to each of the 
anti-HTV dmgs currently available. The resistant "behaviour" of the virus may be the combined 
result of the effects of many different mutations and the complex interactions between them, 
including genetic changes that have not even been identified yet. In other words, it is a direct 
measure of resistance. 

[0123] The test provides a quantitative measure of viral resistance to all the 
available drugs. This is expressed in terms of the IC50. This is then compared to the IC50 for fully 
sensitive, non-mutated "wild-type" virus. The resistance of the sampled vims to each drag is then 
expressed in temis of a fold-change in IC50 compared to wild type. 

[0124] The addition of "clinical cut-offs", as described in this application, to the 
report enables physicians to identify the drag(s) that are no longer clinically active and helps in 
the selection of the optimal combination of drugs for the individual patient. In one embodiment, 
the method of the present invention concems a diagnostic tool for determining the resistance of a 
patient to at least one HIV drug comprising the clinical cut-off fold change resistance value for 
said at least one dmg as determined herein. The diagnostic tool includes phenotypic resistance 
tests such as the Antivirogram®, VirtualPhenotyping® and Phenosense. 
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[0125] The invention includes methods to determine resistance towards HIV 
compounds such as tenofovir, lopinavir, and those compounds disclosed in W099/67417, EP-A- 
945443 and WOOO/27825. Other examples of drugs will be well known to those of skill in the 
art- 

[0126] In one embodiment the effect of drugs on HBV may be monitored using 
technologies such as disclosed by Isom et al (WO 99/37821, Delaney et aL Antimicrob. Agents 
Chemotherap. 2001, 45 (6) 1705-1713). 

[0127] In one embodiment the effect of drugs on HCV towards therapy may be 
determined using techniques such as described by Rice (WO 97108310, WO 98/39031) and 
Barthenschlager (EP 1043399). 

[0128] The primary aim of the invention is to predict the resistance of a disease 
to a particular drug. In addition, however, the invention encompasses methods of evaluating 
currently applied drags and thus monitoring these drugs with a view to assessing the 
effectiveness of that drug and proposing altemative drug(s) or optimizing the drug if deemed 
appropriate. Such methods involve obtaining a sample containing a disease-causing pathogen 
firom a patient, and then performing the steps described in any one or more of the embodiments 
of the invention described above. 

[0129] It will be apparent to the skilled reader that while the invention has been 
described in the below examples with respect to viruses, particularly HIV, the present invention 
has broad applicability to any disease state where it is desired to correlate genotypic information 
with phenotypic profiles and assess the threshold at which a fold change resistance is clinically 
significant. One skilled in the art could readily take the following discussion of the invention 
with the HIV virus and through the exercise of routine skill apply this invention to other diseases 
(such as other viral infections, malignant cells, cancer, bacterial infections, other pathogens, and 
the like) to correlate genotypic information to predict phenotypic response, assess drug 
resistance, and eventually develop a treatment regime of drugs for a particular patient. One 
skilled in the art will also know that many virus species comprise many strains; for instance, HIV 
comprises HIV-2 in addition to HIV-1 and both groups are further divided into groups (such as 
groups O and M for HIV-1). 

[0130] The above methods are diagnostic methods. Further aspects of the 
invention provide diagnostic kits for performing any one of the diagnostic methods of the 
invention described above. The invention further relates to a diagnostic system as herein 
described for use in any of the above described methods. 
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[0131] According to yet another embodiment, the present invention relates to a 
diagnostic system for predicting clinical response to a drug of a disease causing pathogen 
comprising: a) means for obtaining a genetic sequence of the disease producing pathogen; b) 
means for identifying at least one mutation in the genetic sequence of the disease producing 
pathogen; c) genotype database means comprising genotype entries; d) phenotype database 
means comprising phenotypes of patient fold change response values; e) clinical response 
database means comprising clinical response to drug treatment data for reference sample 
patients; f) correlation means correlating a genotype entry with a phenotype, where the genotype 
entry corresponds with the obtained genetic sequence of the disease producing pathogen; g) 
means for modeling clinical response to a drug of the disease causing pathogen by determining 
whether the patient fold change response is above a cut-off value, wherein the cut-off value is 
determined using the clinical response database means and comprises the fold change response 
value at which a clinically relevant diminished clinical response is observed; and h) means for 
predicting the clinical response to a drug of a disease by determining whether the patient fold 
change response is above the cut-off value. 

[0132] As described above, the cut-off value is determined as a function of 
treatment response data in treated subjects, considering baseline pathogen load, baseline fold 
change resistance, baseline activity of co-administered drugs targeted to the pathogen, and 
treatment history. The means for predicting the resistance are preferably computer means. 

[0133] A still further aspect of the invention relates to a computer apparatus or 
computer-based system adapted to perform any one of the methods of the invention described 
above. 

[0134] In a preferred embodiment of the invention, said computer apparatus 
may comprise a processor means incorporating a memory means adapted for storing data; means 
for inputting data relating to the genotype exhibited by a particular disease causing pathogen; and 
computer software means stored in said computer memory that is adapted to perform a method 
according to any one of the embodiments of the invention described above and output a 
prediction of the resistance of a disease causing pathogen toward a drug. 

[0135] A computer system of this aspect of the invention may comprise a 
central processing xmit; an input device for inputting requests; an output device; a memory; and 
at least one bus connecting the central processing unit, the memory, the input device and the 
output device. The memory should store a module that is configured so that upon receiving a 
request to model the response to a dmg of a disease causing pathogen, it performs the steps listed 
in any one of the methods of the invention described above. 
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[0136] In the apparatus and systems of these embodiments of the invention, data 
may be input by downloading the data from a local site such as a memory or disk drive, or 
alternatively from a remote site accessed over a network such as the intemet* Data may be input 
by keyboard, if required. 

[0137] The generated results may be ouQ>ut in any convenient format, for 
example, to a printer, a word processing program, a graphics viewing program or to a screen 
display device. Other convenient formats will be apparent to the skilled reader. 

[0138] The means adapted to predict the resistance of a disease causing agent to 
a drag will preferably comprise computer software means. As the skilled reader will appreciate, 
once the novel and inventive teaching of the invention is appreciated, any number of different 
computer software means may be designed to implement this teaching. 

[0139] According to a still further aspect of the invention, there is provided a 
computer program product for use in conjunction with a computer, said computer program 
comprising a computer readable storage medium and a computer program mechanism embedded 
therein, the computer program mechanism comprising a module that is configured so that upon 
receiving a request to predict the resistance of a disease to a drug, it performs the steps listed in 
any one of the methods of the invention described above. 

[0140] The invention further relates to systems, computer program products, 
business methods, server side and client side systems and methods for generating, providing, and 
transmitting the results of the above methods. 

[0141] The invention will now be described by way of example with particular 
reference to a specific system that implements the process of the invention. As the skilled reader 
will appreciate, variations from this specific illustrated embodiment are of course possible 
without departing from the scope of the invention. 
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EXEMPLARY EMBODIMENTS 

Example: Process description of the determination of clinical cut-offs 

Step 1: Clinical Data Base and Analysis Data Set 

[0142] Databases of studies for patients with tenofovir containing regimens and 
consisting of patient baseline demographic characteristics, clinical outcome results with viral 
load and resistance data (Fold change), were retrieved and remapped according to a common 
structure allowing a meta-analysis. The structure consisted of baseline sequence, viral load data 
set, viral load measurements and sampling dates (for example viral load within 3 months of 
starting new regimen and viral load assessment 8 and/or 24 weeks after beginning new regimen), 
CD44- data set which contains CD4+ counts and sampling dates, resistance data set containing 
the fold changes to different antivirals and sampling dates; patient data set with patient 
information such as age, gender, race, treatment history; treatment data set with drug regimens, 
start and stop dates, doses, formulations, frequency of intake, regimen changes after resistance 
tests. The stmcture of such a clinical data base can be seen in figure la. The following table 
shows an example of characteristics of analysis datasets (8 week outcome) for individual drugs. 





Range (Drug) 


Median Baseline Viral Load (log) 


3.32 (TDF) ~ 4-71 (boosted IDV) 


Median background cPSS 


1.34 (ddC) ~ 2.58 (LPV/r) 


# regimens including the drug 


24 (unboosted APV) - 1551 f3TO 


% from cohort data 


21% (unboosted APV) 83% (ddl-EC) 


% with no resistance mutations 


14.5% (boosted APV) - 75% (EFV) 



[0143] Figure lb shows an example analysis dataset for d4T. The viral load 
response data includes censored (<400 copies/ml) values. Parameters of the underlying 
uncensored distribution were estimated by maximum likelihood estimation in SAS (proc lifereg). 
The viral load response data are corrected for baseline viral load and cPSS. 

Step 2: Modelling 



[0144] The clinical outcome results (drop in viral load and response rate) were 
modelled as a function of baseline fold change (FC) as determined by virtual phenotype (see 
WOOl/79540 and WO02/33402; also http://www,vircolabxom). The models applied were linear 
regression, logistic regression, and a classification tree. These models also took into account 
effects of the concomitant HIV drugs (PSS), baselme viral load (Baseline Log(Vl)i) and, 
optionally, treatment history in order to avoid bias introduced by imbalances of important 
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characteristics. From the models, a prediction of clinical outcome could be made at different 
levels of the baseline fold change resistance. 

[0145] In the linear regression model, the proposed equation was the following: 

LogVL dropi = pO + pi*Baseline Log(Vl)i -i- p2 *PSSi + p3 *(l/FC)i + §i 
where i represented the patient, PO the intercept, pi, p 2 and p3 coefScients indicated the 
increase in log viral load drop per unit increase of respectively the baseline log VL, the number 
of sensitive drugs in the backgroimd regimen and the inverse of the baseline fold change, was a 
random error term indicating the deviation of the patient from the value predicted by the model. 
Interactions between all the factors were evaluated and other baseline characteristics, i.e 
treatment history, were added if relevant. After applying the regression model, the curve as 
depicted in Figure 2a was obtained. 

[0146] Example curves showing linear regression models of 8 week viral load 
response as a function of baseline phenotypic resistance are shown in figures 2b and 2c. The 
curves shown are for subjects with a log baseline viral load = 4.0 and a cPSS score for the 
background regimen of 2.0. Figure 2b shows models for nucleoside(tide) RT inhibitors (from top 
to bottom at FC=5, the respective curves represent d4T; TDF; ddl, EC; ABC; 3TC and AZT). 
Figure 2c shows models for boosted and unboosted protease inhibitors (from top to bottom at 
FC=10 the respective curves represent: top left graph - NVP and LPV/r; top right graph - IDV 
and IDV/r; bottom right graph - SQV/r and SQV; bottom left graph - APV/r and APV). 

[0147] A fundamental issue with modeling cUnical outcome for non- 
nucleoside(tide) RT inhibitors is that the baseline fold change may have little effect on treatment 
response to current NNRTIs in NNRTI experienced patients. For the NNRTI of neviripine, 
polarisation of fold change values is observed (see figure 2d). Furthermore, an extremely broad 
dispersal of fold change values is observed for efavirenz (see figure 2e). 

[0148] In a preferred embodiment of the linear regression model, more factors 
are included in order to obtain a more refined prediction of vfral load response. These factors 
include for example a sensitivity score per dmg class in addition to the overall sensitivity score 
of the background treatment (cPSS), previous exposure to the drag (naive (Yes/No), naive to Pis, 
naive to NRTI's, etc. .„). Furthermore, in the preferred embodiment, the fold change is 
transformed before inserting the figures into the model. The transformation to the fold change 
comprises a power-transformation ranging from FC'^ to FC*. In addition, a quadratic term in 
cPSS is preferably added. 
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[0149] Accordingly, a more general form of the equation given above may be 

expressed as: 

wherein p is a power transformation (e-g. ranging from -3 to 1) and H5 to H„ are treatment history 
parameters (e.g. naJve to antiretroviral therapy, nalVe to NRTI treatment, etc. ...) or parameters 
describing the background therapy as a function of a certain therapeutic class (e.g. the number of 
active NRTI's taken concomitantly with the dmg under investigation). 

[0150] Example curves showing linear regression models as calculated using 
this model are shown in Figures 2f and 2g. Here, the presentation of the data is different in tiiat 
the % response is plotted, as calculated for the preferred CCO definition, rather than the viral 
load drop. 

[0151] In the logistic regression, the proposed equation was the following: 

Prob of success = ^xp(^o ^^Log(BaselineVL, ) + jPSS,) ^ A (1/ i^Q)) 

il-^oxp(jS^+jS,LogiBaselmeVL,)-hj32(PSS,)'hjS^(l/FC,^^^ ' 

where pi, P2, and p3 represented the log odds ratio of success for the corresponding factors in 
the model. After applying the logistic regression model, the curves as depicted in Figures 5 a and 
7 were obtained. In the classification trees model, tree-structured rules were constructed in order 
to classify patients in successes (undetectable viral load after treatment) and failures. The same 
parameters as for the other techniques were considered. The tree shown in Figure 8 was obtained 
after applying the classification tree model. 

[0152] When viral load results under the detection limits are obtained, biases 
could be introduced if the detection limit values are considered when calculating viral load drops 
and using those in the linear regression model. To avoid this, censoring needed to be taken into 
account and therefore the PROC LIFEREG facility in the S AS package was employed. 

[0153] An advantage of this regression model is that it takes into account the 
maximum amount of information present in the data, i.e. correlating specific clinical responses 
with specific Fold changes while the other two models clusters the patients in two groups 
(successes versus failures), thus not taking into account differences in responses within the same 
group. Estimates are corrected for covariates in the model (e.g. background regimen) and 
therefore, they do not suffer fi-om imbalances in the covariates. Conclusions are limited to 
patients with covariates that are represented in the clinical database, 

[0154] Logistic regression does not suffer from the censoring problem and the 
probability of success is an intuitive way of interpreting clinical outcome. However, by biiming 
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the viral load into successes and failures, part of the information of the continuous variable viral 
load is lost. Estimates are also corrected for covariates as for linear regression. 

[0155] Classification trees are very visual and easy to interpret, but they have 
the disadvantage that the decisions do not take into account the value of other relevant 
parameters. This implies that imbalances for other parameters may influence the decision taken 
for a certain parameter. However, they provide insights iii the importance of several parameters 
and this can be helpful in the fitting process of the other approaches. Figures 8a, 8b and 8c show 
examples of classification trees. Figure 8b shows a classification tree for 3TC, whilst figure 8c 
shows a classitication tree for TDF assuming that the cost of classifying a failed regimen as a 
success is 1.5 times the cost of classifying a successfiil regimen as a failure. 

Step 3: Determination of the clinical cut-off, 

[0156] Clinical responses were predicted in the models developed in previous 
step 2, The approach is to identify two cutoffs per drug: a "lower" cutoff which represents the 
fold change at which the response begins to be lost, and an "upper" cutoff which represents the 
fold change at which the response is essentially gone, la order to determine the fold changes at 
which clinically relevant diminished clinical responses can be observed, three definitions of 
clinical cut-offs were considered: 

Definition 1: 

[0157] Sensitive: predicted viraf load drop is more than 0.6 logs. 

[0158] Intermediate resistant: predicted viral load drop is between 0.2 and 0.6 

logs. 

[0159] Resistant: predicted viral load drop is less than 0.2 logs. 

[0160] The lower and higher cutoffs are defined as the fold change with 
expected log viral load drops of ^0.6 and <0.2 respectively. 

[0161] This definition of clinical cut-off addresses the potency of an entire 
combination regimen and is highly dependent on the characteristics of the specific patient 
regimens analysed. 

Deflnition 2: 

[0162] The maximxmi effect was defined as the treatment effect at fold change 
1, and the minimum effect was defined as the treatment effect at a very high fold change (i.e. 
when the curve reached a plateau). The effect range was then the difference between the 
maximum effect and the minimum effect. 
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[0163] Sensitive: the predicted treatment effect is more than 80% of the effect 

range. 

[0164] Intemiediate resistant: the predicted treatment effect is between 20% and 
80% of the effect range. 

[0165] Resistant: the predicted treatment effect is less than 20% of the effect 

range. 

[0166] The lower and higher cutoffs are defined as the fold change associated 
with an expected 20% and 80% decrease respectively of the reference activity of the drug within 
the regimen. 

[0167] Cutoffs obtained using definition 2 do not address the potency of the 
entire treatment regimen, but rather give an estimation of the activity of the drug within the ., 
regimen. The absolute magnitude of the viral load drop depends on specific covariates. 

Definition 3: 

[0168] Definition 3 was a variant of definition 2. The lower cut-off was defined 
as the fold change that most optimally distinguished patients between successful and 
unsuccessful treatments. 

[0169] Using definition 3, breakpoints determined by classification trees are 
applicable only to a subset of patients unless fold change is selected at the first tree node. 
Classification trees are easy to interpret but the cutoffs do not indicate the magnitude of the viral 
load reduction expected for the whole regimen or the drug within the regimen. Breakpoints 
determined by the linear and the logistic regression models are close to the lower cutoffs as 
defined by definition 2, except for the boosted protease inhibitors. 

[0170] The methodology was applied for Tenofovir on a population taking two 
active drugs besides tenofovir and with a baseline Log(Vl) of 4. 

[0171] When we applied definition 1 on the linear regression model, the 
observed drop in log viral load was -0.6 at fold change 3.73 (fig 3a). No higher cut-off could be 
derived as this population experienced a drop in Log(VL) greater then 0.2 even with a high 
baseline fold change for tenofovir. This could be explained by the effect of the active 
backgroimd regimen in this population. 

[0172] When we applied definition 2 on the linear regression model (fig 4a), the 
observed drop in log viral load was -1.48 at fold change 1, and -0.28 at the assymptotic fold 
change. Therefore the effect range was -0.28 + 1 .48 =1 .2. 
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[0173] 20% of this effect range was observed at fold change 5 (and this value 
was considered as the upper clinical cut-off value), 

[0174] 80% of the effect range was observed at fold change 1 ,25 (and this value 
was considered as the lower clinical cut-oflF value), 

[0175] To predict the resistance according to this regression model, we 
determined whether the patient fold change resistance was above, below, or in between the 
clinical cut-off as calculated according to definition 2. So, when the FC of patient was of 0.8 
(below the lower clinical cut-of3F), a normal clinical response was predicted. If the FC of the 
patient was of 2 (above the lower clinical cut-off and below flie upper clinical cut-off), a reduced 
clinical response was predicted. If the FC was of 7 (above the clinical cut-off), then the clinical 
response was predicted as being minimal. 

[01761 Definition 2 was also applied to the logistic model (fig. 5 a) and this 
resulted in a lower cut-off at 1.2 FC and a higher cut-off at 3.81 FC.The results for definition 3 
are depicted in fig 6 and 7. 

[01771 The Tenofovir results for the population with 2 active drugs in the 
regimen and a baseline Log(VL) of 4 are summarized in the Table below. 

[01781 From the Table, it can be derived that the lower cut-off for definition is 
1.2 and the higher cut-off ranges firom 3.81 to 5 for the population of patients taking 2 active 
drugs besides tenofovir and with a baseline log(VL) of 4. The variation in cut-offs determined by 
the different definitions is a "result of the different influence of the covariates such as PSS and log 
VL. That is, the influence of the covariates is significant when using definition 1 and less 
significant when using definition 2. 

[0179] Definition 1 can only be applied on the linear regression model. The 
clinical cut-offs determined using definition 1 are highly dependent on the characteristics of the 
subpopulation. This is due to the fact that definition 1 describes the potency of the whole drag 
regimen while definition 2 is related only to the activity of the drug mider consideration and its 
resistance profile. In other words, the activity of the background regimen together with the drug 
under investigation determines the viral load drop that the patient will experience and hence the 
dependence of the cut-off on the background regimen. The activity of the background regimen 
does not change the resistance profile in a profound way, therefore the cliiucal cut-offs do not 
vary considerably with the population characteristics. 
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Definition of 






Linear 
Regression 


Logistic 
Regression 


Classification 
Tree 


Clinical 
cut-ofr 


Population 


Properties of the subsroup 


Lx)wer 
CO 


Higtier 
CO 


Lower 
CO 


Higher 
CO 


Lower 
CO 


Higher 
c*r\ 


Definition J 


Subgroup J 


PSS-2, baseline Log(VL) = 4 


/ J 


^ assay limit 


InA 


NA 


NA 


NA 






PV^zs/t ht9^^l!m*a /am-/'I/J^1 M 

roO'^U, Oaseiine lJOg(vL.J — 4 


I.oo 


3.8 


NA 


NA 


NA 


NA 




Subgroup 3 


f^Ci—j^, oasetine Log(vL) =^ 3 


> assay limit 


> assay limit 


NA 


NA 


NA 


NA 




Overall 




NA 


NA 


NA 


NA 


NA 


NA 


Definition 2 


Subgroup J 


PSS=-2, baseline JLog(VL) = 4 


1.25 


5 


1.2 


3.81 


NA 


NA 




Subgroup 2 


PSS^O, baseline Log(VL) = 4 


1.25 


5 


1.16 


3.36 


NA 


NA 




Subgroup 3 


PSS='2, baseline Log(VL) « 5 


1.25 


5 


1.17 


3.4 


NA 


NA 


Definition 3 


Overall 




1.25 


5 


NA 


NA 


NA 


NA 


Subgroup I 


PSS=2, baseline Log(VL) ^ 4 


I.l 


5 


1.2 


3.81 


1.15 


NA 




Subgroup 2 


PSS^O, baseline Log(VL) = 4 


l.l 


5 


1.2 


3.36 


1.15 


NA 




Subgroup 3 


PSS^2, baseline ljog(VL) = 5 


1.1 


5 


1.2 


3.4 


1.15 


NA 




Overall 




1-1 


5 


1.2 


NA 


1.15 


NA 


NA: Not Applicable 







[0180] Both linear and logistic regression models give similar results for 

definition 2, 

[0181] When we applied definition 1 on the linear regression model for patients 
with a log baseline viral load of 5 and all patients taking two active drugs in addition to 
d4T(stavudine), a viral load drop of more than 0.6 log copies/mL for any fold change of d4T is 
predicted (figure 3b). The viral load drop is predicted to be -0.6 logs and -0.2 logs at fold 
changes 2.6 and 4.0 for patients with a log basehne viral load of 5 and taking no active drugs in 
addition to d4T (see figure 3b). 

[0182] Lower and upper cutoffs determined using definition 2 for lopinavir/r are 
shown in figure 4b as 8 and 69 respectively for the whole population if viral load is modeled 
using linear regression, while the lower and upper cutto& are 1 1 and 64, 10 and 60, and 9 and 58 
respectively for populations with log baseline VL/background PSS of 4/2, 5/0 and 5/2 
respectively if the failure rate is modeled using logistic regression (see figure 5b). The PSS is not 
statistically significant in this example. 

[0183] ILower and upper cutoffs determined using definition 2 for boosted 
saquinavir for the logistic model were 1.7 and 13.2, and 1,7 and 12.9 respectively for populations 
with log baseline viral load/phenotypic sensitivity score for the background regimen of 4/2 and 
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5/0 respectively. Lower and upper cutofig by Unear regression for saquiavir/r were 1.6 and 12.3 
respectively for the whole population. 

[0184] Lower and upper cutoifs determined using definition 2 for AZT are 
shown in figures 4c and 4d if viral load is modeled using linear regression. Figure 4c shows the 
change in viral load vs. the fold, change whilst figure 4d shows the % loss of reference response 
vs. the fold change. 

[01851 Further tests with records for >1 3,000 patients yielded -3 1 50 regimens 
with the required baseline and outcome variables, ranging &om 60 regimens including boosted 
SQV soft gel to 1546 including 3TC. Median log baseline viral load ranged fi^om 3.8 (regimens 
with tenofovir) to 4.7 (regimens with boosed indinavir). Median PSS of background regimens 
was 2 (range 0-7). Hie following table shows results of preliminary VirtualPhenotype™ Clinical 
Cutoffs for nucleosides(tides) and boosted and unboosted protease inhibitors from a preliminary 
linear regression analysis. The fold change values associated with a 20% diminution of virologic 
response at 8 weeks and an 80% diminution of virologic response compared to the maximal 
response are shown with a 95% confidence interval in square brackets. 
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DRUG 


KIHrK4£PHENOTYPE™ 
PREDICTED FC OF WILD TYPE 
CLINICAL ISOLATES 


VmOLOGIC RESPONSE "] 


20% 


80% 1 
REDUCTION [ 


AZT 


-0.8 


1.8 

11. 5-2.51 


17 

[10-25] 


3TC 

1 a mi vn Ti A 
xciaxu V m til IP 


0.8 


1.1 
L1-1-1.2J 


2.6 
[1.9-4.61 


D4T 
staviiflitif 


0.7 


1.3 
[1.2-1.41 


3.4 
[3.1-3.6] 


ddl 

uivuiuuaiiic 

(extended 


0.6 

• 


1.3 
[1.2-1.9] 


3.6 
[2.8-4.9] 


ABC 
abacavir 


0.6 


1.6 

1 1.1-Z-Oj 


5.8 
[1.7-7.41 


TDF 


0.8 


1.2 


2.5 
[1.7-3.81 


IDV 
indinavir 


0.7 


1.2 

[1.1-1.91 


3.4 
[1.9-16.4] 


rVD/r 

Indinavif/r 


3.5 
[1.1-8.41 


25 
[1.8-311 


NFV 
nelfinavir 


0.9 


1.1 
[1.1-1.31 


2.2 
ri. 7-5.3] 


SQV 
saquinavir 


0.6 


1.1 
[1.1-2.11 


2.0 
[1.7-18] 


SQV/r 
1 Saquinavir/r 


1.6 
[1.3-4.81 


12 

. [5.8-27] 


AMP 
amprenavir 


0.6 


1.2 
[1.1-2.41 


3.4 
[1.7-10.2] 


AMP/r 
amprenavir/r 


1.5 
[1.2-2.61 


6.-8- 
[3.6-10.5] 


LPV/r 
Lopinavir/r 


0.8 


6.9 
[2.1-17.41 


56 
[29-671 



[0186] Similar values were determined in logistic regression models. While the 
magnitude of the virologic response for individual patients is affected by covariates such as viral 
load and PSS, FC values associated with fractions of the effect range are not. 

[0187] Further test results showed that the cUnical cutoffs for 20% reduced 
response after 8 weeks were low (1.1 to 1.2 for unboosted Pis), but higher than Virco type 
predicted fold changes for wild type clinical isolates(0.73 (ABC) to 1.07 (AZT) for IsrRTIs, 0.63 
(APV) to 0.87 (NFV) for Pis). Clinical cutoffs for 80% reduced response were 3.4(APV, IDV) 
for unboosted Pis. Chnical cutoffs for boosted Pis were higher: 1.5 (APV/r) for 20% reduction 
and 6.8 (APV/r) for 80% reduction. Among the treatment regimens analysed, sensitivity classes 
defined by these clinical cutoffs showed different rates of virologic response (viral load drop of 
more than 1 .0 log for BQL at 8 weeks) to regimens including the drug: 70 to 92% for fold 
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changes less than the lower clinical cutoff, 39 to 68% for fold changes between the upper and 
lower clinical cutoffs, and 18 to 50% for fold changes above the upper clinical cutoff, 

[0188] In another example the following linear regression model is used for d4T 
with clinical cutoff definition 2 when Log(BaselineVL) = 4, lower reference fold change =0.9 
and upper reference fold change=3, cPSS=2, PSS[1SIRTI]=1 and NRTI[naive]=0, using the 
following linear regression: 

VLdrop = 2.91 - 0,63Log(BaselineVL) - 1,66(FC-^'^) ~ 0.99(cPSS) + 0.15(cPSS^) 

- 0A8(PSS[NRTI]) + 0.91(NRTI[naive]). 

[0189] The NRTI[naYve] value represents whether the patient is naive to 
nucleside RT inhibitors. If yes, value =1, if no, value =0. The PSS[NRTI} represents the 
phenotypic sensitivity score for NRTIs, i.e the number of active NRTIs in the background 
regimen for the patient. 

[0190] Stage 1 : Calculate VL drop at the lower reference FC (drug is fully 
active) and the VL drop at the upper reference FC (minimal activity of the drug). This leads to a 
lower VL drop of -2.94 and an upper VL drop of -2.03. 

[0191] Stage 2: Calculate VL drop when dmg has lost 20% of its activity and 
the VL drop when drag has lost 80% of its activity. This gives values of VL drop=-2.76 for 20% 
and -2.2 1 for 80% loss of activity. 

[0192] Stage 3: Determine Fold change equivalent to VL drop at 20% and 80% 
loss of activity by inserting values for VL drop into the above equation and calculating FC. This 
gives FC values of 1.08 and 2.18. Consequently, the lower and upper clinical cutoff values for 
d4T are modeled as 1.1 and 2.2 respectively. 

Step 4: Validation of the cut-offs 

[0193] The models were validated using bootstrapping and repeating the steps 
described above several times. Bootstrapping is a resampling technique in which pseudo- 
populations of the same size as the original population are created by randomly drawing samples 
from the original population. Analysis of each of these populations gives a sense of the sampling 
variability of the clinical cut-off. 

[0194] Figure 9 shows initial validation of preliminary F/r^i/a/Phenotype™ 
Clinical Cutoffs for nuclesides(tides) and boosted and unboosted protease inhibitors by 
bootstrapping with 90% confidence intervals. 

[0195] The problem is tackled from different points of view in order to assess 
the robustness of the analysis results. The clinical cut-offs obtained could be ftirther refined by 
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adding more data sets and by taking more characteristics of the patients into account. The 
clinical-cutoffs obtained could also be further refined by performing the model on unseen data. 

[0196] An altemative method of validation of the model includes calculation of 
a concordance index (c-index) which describes how all the models can discriminate between 
patients with a different response. This c-index is calculated on the data set used for model 
development and on a test data set. If the difference between the two c-indices is small, it means 
that the models do not lose their predictive ability if applied to new data. The following table 
contains infomiation regarding the validation procedure for each drug tested when modelled 
using the preferred embodiment of linear regression which includes the additional factors of 
sensitivity drug per class in addition to the overall sensitivity score of the backgroimd treatment 
and previous exposure to the drug (e.g. naive, naive to Pis, naive to NRTIs). The test data c- 
index and the validation data c-index columns relate to the c-indicies for the original test data set 
and the new data set for validation purposes. The modeled lower and upper values of clinical 
cutoff are quoted with corresponding confidence intervals. 



Drug 


Test Data 


Validation Data 


Modelled 
Lower Cutoff & 
Confidence 
Interval 


Modelled Upper 
Cutoff & 
Confidence 
Interval 


C- 
ittdex 


ceo 

Odds 
Ratio 


BCO 
Odds 
Ratio 


C- 
Index 


ceo 

Odds 
Ratio 


BCO 
Odds 
Ratio 


AZT 


0,79 


0.065 


0.107 


0.78 


0.252 


0.316 


1.9 r 1.52^2,76] 


14.4 [8.24-21.201 


3TC 


0.79 


0.160 


0.202 


0.79 


0.265 


0.284 


1.1 rO.98- 1.39] 


3.7 [1.71 - 11,441 


D4T 


0,77 


0.113 


0.239 


0.76 


0.366 


0.377. 


1.1 riM5- 1.121 


2.2 [2.05 - 2.301 


DDIE 


0.73 


0.219 


0.379 


0.71 


0.119 


0.159 


1.3 [1.07-1.341 


3.0 [2.59-2.961 


ABC 


0.70 


0.651 


0.359 


0.69 


0.544 


0.380 


0.8 [0.75 - L72J 


L2 [1.19-5.111 


TDF 


0.7 J 


0.228 


0.570 


0.68 


0.226 




1.0 [0.97-1.321 


2.0 [1.51 -2.951 


NVP 


0.79 


0.282 


0.290 


0.83 


0.054 


0.054 


1.5 [1.40- 16.74] 


3.2 [2.22-63.061 


EFV 


0.79 


0.079 


0.099 


0.79 


0.041 


0.042 


1.8 [1.41 -3.741 


29.2 [6.46- 146.691 


IDV 


0.76 


0.098 


0.194 


0.79 


0,270 


0.598 


0.8 [0.77-1.041 


2.2 [1.33-7.191 


mv/r 


0.71 


0.027 


0.122 


0.63 


0.012 


0.038 


4.1 [0.77-6.241 


. 21.2 [1.41 -22.861 


APV 


0.86 


0.051 


0.061 








0.7 [0.65 - 0.871 


1.4 [1.03 - 5.551 


APV/r 


0.73 


0.005 


0.053 








0.9 [0.80-2.721 


6.5 [4.06- 16.22] 


NFV 


0.76 


0.177 


0.291 


0.77 


1.068 


0.938 


LO [0.97 - 1 .031 


1.5 [1.54-2.38] 


SQV 


0.77 


0.332 


0.431 


0.56 






0.7 [0.65 - 2.281 


LO [1.03-22.611 


SQV/r 


0.71 


0.165 


0.249 


0.81 


0.054 


0.080 


LI [081-5.981 


12.0 [4.50-28.851 


LPV/r 


0.70 


0.074 


0.240 


0.75 


0.002 


0.277 


10.3 [1.53- 17.301 


61.6 [21.92-66.961 



[0197] The columns headed CCO Odds Ratio and BCO Odds Ratio represent 
the odds of being a responder in the group that is labeled resistance by the cutoffs divided by the 
odds of being a responder in the group that is labeled sensitive by the cutoffs. For example, if 
there are four non-responders and one responder in the resistant group, and fovu- responders and 
two non-responders in the sensitive group, the odds of being a responder in the resistant group is 
1 :4 = 0.25 and the odds of being a responder in the sensitive group is 4:2 = 2. This means that the 
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odds ratio is 0.25/2 = 0,125. In other words the odds of being a responder in the resistant group 
are 0,125 times the odds of being a responder in the sensitive group. The further away that the 
odds ratio is fix>m 1, the stronger the correlation between the resistance class and clinical 
outcome, 

[0198] The separation in groups (resistance/sensitive) can be done based on 
clinical cutoflFs or biological cutoffs. This means that odds ratios based on clinical cutoffs (CCO) 
and odds ratios based on biological cutoffs (BCO) can be compared. If the odds ratio for the 
CCO is smaller than the odds ratio for the BCO, it can be concluded that the CCO gives a better 
prediction than the BCO. 

[01991 In practice, a logistic regression model is used to detemiine the odds 
ratio. The model used is similar to logistic regression model described earlier except that the fold 
change in the model is replaced by the resistance class. The coefBcient for the resistance class 
that is obtained from the model is the log (odds ratio). The advantage of using this model is that 
the odds ratio estimates can be adjusted for the baseline viral load and the cPSS score of the 
background regimen. In one dataset, using the CCOs set out on page 36 herein, the odds ratio 
for response per additional active drug added was 3.01 when calculated using clinical cutoffs, 
and 2.32 when calculated using biological cutoffs. These odds ratios are ratios of odds of 
response for people taking more active drugs over the odds for people taking less active drugs. 
The<odds ratio here should be > 1 as the probability of response will increase as the number of 
active drugs taken increases. In this case, a larger odds ratio indicates a stronger correlation with 
clinical outcome. 

[0200] The disclosures of each patent, patent application and publication cited 
or described in this document are hereby incorporated herein by reference in their entirety. 

[0201] Those skilled in the art will appreciate that numerous changes and 
modijScations can be made to the embodiments of the invention and that such changes and 
modifications can be made without departing from the spirit of the invention. It is, therefore, 
intended that the appended claims cover all such equivalent variations as fall within the true 
spirit and scope of the invention. 
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What is Claimed: 

1 . A diagnostic method for estimating for a patient the treatment response of a disease 
caused by a pathogen to a dmg, the method comprising: 

comparing the fold change resistance value of the pathogen infecting the patient to a 
clinical cut-off value which is the fold change resistance value at which a clinically relevant 
variation of clinical response is observed; 

wherein the clinical cut-off value is established by modeling the clinical response of a 
population of patients treated with the drag to the disease caused by the pathogen as a function of 
the fold change resistance of the pathogen infecting the patients. 

2. A method according to claim 1, wherein the cut-off value is determined as a function of 
treatment response data in treated subjects, considering baseline pathogen load, baseline fold 
change resistance and baseline activity of coadministered drugs targeted to the pathogen. 

3. A method according to claim 1, wherein the cut-off value is calculated by reference to the 
pathogen load drop. 

4. A mettiod according to claim 3, wherein the cut-off value is calculated by reference to the 
log pathogen load drop. 

5. A method according to claim 4, wherein the log pathogen load drop is calculated by 
performing a linear regression analysis usingdata from a dataset of treatment response data, 
wherein the log pathogen load drop LogPL dropu for the pathogen infecting a patient i, is 
modelled as the sum of all of the individual contributions for factors that influence pathogen load 
drop, according to the following equation: 

LogPLdrop,== fi^^ /3,Log{BaselinePL,)'¥ fi^iPSS,)^ fi^il/ FC,^ 

wherein BaselinePU represents the pathogen load of the patient measured at the start of 
treatment by the drug, 

PSSi is a phenotypic sensitivity score representing the number of active drugs in the 
background treatment regimen for the patient, excluding the drag whose contribution to 
treatment response is being modelled, 

FCi is a baseline fold change resistance, 

Po is the intercept. 

Pi is a coefficient representing the increase in log pathogen load drop per xmit increase of 
the log of the BaselinePLu 

P2 is a coefficient indicating the increase in log pathogen load drop per unit increase of 
the number of sensitive drags in the background treatment regimen, 
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is a coefficient indicating the increase in log pathogen load drop per unit increase of 
the inverse of FCu 

and wherein the error term, e/, represents the difference between the modelled prediction 
and the experimentally determined measurement. 

6. A method according to claim 4, wherein the log pathogen load drop is calculated by 
performing a linear regression analysis using data j&om a dataset of treatment response data, 
wherein the log pathogen load drop LogPL drop,, for the pathogen infecting a patient i, is 
modelled as the sum of all of the individual contributions for factors that influence pathogen load 
drop, according to the following equation: 

wherein the terms of the equation are the same as those given in claim 5, and 
additionally, p is a power transformation (e.g. ranging from -3 to 1) and Hs to Hn are treatment 
history parameters or parameters describing the background therapy as a function of a certain 
therapeutic class. 

7. A method according to claim 1, wherein the cut-off response value is calculated by 
reference to the probability of the pathogen being susceptible to treatment by the drug for the 
patient, herein termed Prob of success. 

8. __ A method according to claim 7, wherein Prob of success is calculated by performing a 
logistic regression analysis using data from a dataset of treatment response data, wherein Prob of 
success is modelled according to the following equation: 

Prob of success = ^xpC^gp + fi.LogiBaselinePL,)^ fi^ (PSS,) + J3,(1/FC,)) 

(1 + oxpiJS^ + fi.LogiBaselinePL^ ) + yff^ {PSS^) + /?3 (1 / FC, ))) 

wherein BaselinePLi represents the pathogen load of the patient measured at the start of 
treatment by the drug, 

PSSi is a phenotypic sensitivity score representing the niunber of active drugs in the 
background treatment regimen for the patient, excluding the drug whose contribution to 
treatment response is being modelled, 

FCi is a baseline fold change resistance, 

Po is the intercept. 

Pi is a coefficient representmg the increase in log pathogen load drop per unit increase of 
the log of the BaselinePLu 

p2 is a coefficient indicating the increase in log pathogen load drop per unit increase of 
the number of sensitive dmgs in the background treatment regimen, and 
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Ps is a coefficient indicating Ihe increase in log pathogen load drop per unit increase of 
the inverse of FCi. 

9. A method according to claim 1, wherein the cut-off fold change resistance value is 
calculated by reference to the likelihood of a patient achieving treatment success or failure, 
where a definition of success is having an undetectable pathogen load after treatment with a 
particular drug, using a classification tree. 

10. A method according to claim 9, wherein tiie clinical cut-ofif is defined as the fold change 
resistance threshold value that makes the best distinction between the population with successfiil 
treatments and the population with unsuccessfiil treatments. 

11. A method according to any one of the preceding claims, wherein the baseline fold change 
resistance is determined by comparing the genotype of the disease causing pathogen to 
phenotype data collected firom a group of patients infected with a pathogen of similar genotype. 

12. A method according to claim 1 1 , wherein the baseline fold change resistance is 
determined using the Virtual Phenotype system, or a variation thereof. 

13. A method according to claim 1, that incorporates two or more of the methods recited in 
claims 5, 6, 8 and 9. 

14. A method according to any one of the preceding claims which is a computer- 
implemented method. 

15. A method according to claim 14, which is an automated method. 

16. A method according to any one of the preceding claims, wherein the disease causing 
pathogen is obtained firom a patient sample chosen firom a blood sample, a biopsy sample, a 
plasma sample, a saliva sample, a tissue sample, and a bodily fluid or mucous sample. 

17. A method according to any one of the preceding claims, wherein the disease causing 
pathogen is a virus. 

18. A method according to claim 17, wherein the disease causing virus is chosen from HIV, 
HCV and HBV. 

19. A method according to any one of the preceding claims, wherein the method is performed 
for a number of candidate drugs so as to provide information on the predicted fold resistance 
exhibited by the pathogen to a spectrum of candidate drags. 
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20. A diagnostic method for optimising a drug therapy in a patient, comprising performing a 
method according to any one of the preceding claims for each dmg or combination of drugs 
being considered to obtain a series of dmg resistance phenotypes and therefore assess the effect 
of the plm-ality of drugs or drug combinations on the pathogen with which the patient is infected 
and selecting the dmg or drug combination for which the pathogen is predicted to have the 
lowest fold resistance. 

21 . Use of a method according to any one of the preceding claims for assessing the efficiency 
of a patient's therapy or for evaluating or optimizing a therapy, 

22. A diagnostic system for predicting clinical response to a dmg of a disease causing 
pathogen comprising: 

a) means for obtaining a genetic sequence of the disease producing pathogen; 

b) means for identifying at least one mutation in the genetic sequence of the disease 
producing pathogen; 

c) genotype database means comprising genotype entries; 

d) phenotype database means comprising phenotypes of patient fold change response 

values; 

e) clinical response database means comprising clinical response to dmg treatment for 
reference sample patients; 

f) correlation means correlating a genotype entry with a phenotype, where the genotype 
entry corresponds with the obtained genetic sequence of the disease producing pathogen; • 

g) means for modelling clinical response to a dmg of the disease causing pathogen by 
detemiining whether the patient fold change response is above a cut-oflf value, wherein the cut- 
off value is determined using the clinical response database means and comprises the fold change 
response value at which a clinically relevant diminished clinical response is observed; and 

h) means for predicting the clinical response to a dmg of a disease by determining 
whether the patient fold change response is above the cut-off value. 

23. A diagnostic system according to claim 22, wherein the cut-off value is determined as a 
function of treatment response data in treated subjects, considering baseline pathogen load, 
baseline fold change resistance, baseline activity of co-administered dmgs targeted to the 
pathogen and treatment history. 

24. A computer apparatus or computer-based system adapted to perform the method of any 
one of claims 1-19. 
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25. A computer program product for use in conjimction with a computer, said computer 
program comprising a computer readable storage medium and a computer program mechanism 
embedded therein, the computer program mechanism comprising a module that is configured so 
that upon receiving a request to predict the response of a disease caused by a pathogen to a dmg 
it performs a method according to any one of claims 1-19. 
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ABSTRACT 

Methods and systems for improving the accuracy of predicting resistance of a disease to a 
dmg are described. More specifically, methods for assessing the impact of pre-existing variations 
in drug susceptibility^ whether naturally occumng or selected by previous drug exposure, on 
treatment response in order to establish clinically relevant cut-off values for phenotypic or 
genotypic resistance tests are described. 
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Definition 1 applied on drop in Log(VL) (linear regression) 
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Definition 2 applied on drop in Log(VL) (linear regression) 
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Definition 2 applied on the probability of failure (=1 /Probability of success) (logistic regression) 
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Figure 6 



Definition 3 applied on drop in Log(VL) (linear regression) 
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Figure? 



Definition 3 applied on the probability of failure (=1 /Probability of success) (logistic regression) 
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